Evaluation of exome variants using the Ion Proton Platform to sequence error-prone regions

نویسندگان

  • Heewon Seo
  • Yoomi Park
  • Byung Joo Min
  • Myung Eui Seo
  • Ju Han Kim
چکیده

The Ion Proton sequencer from Thermo Fisher accurately determines sequence variants from target regions with a rapid turnaround time at a low cost. However, misleading variant-calling errors can occur. We performed a systematic evaluation and manual curation of read-level alignments for the 675 ultrarare variants reported by the Ion Proton sequencer from 27 whole-exome sequencing data but that are not present in either the 1000 Genomes Project and the Exome Aggregation Consortium. We classified positive variant calls into 393 highly likely false positives, 126 likely false positives, and 156 likely true positives, which comprised 58.2%, 18.7%, and 23.1% of the variants, respectively. We identified four distinct error patterns of variant calling that may be bioinformatically corrected when using different strategies: simplicity region, SNV cluster, peripheral sequence read, and base inversion. Local de novo assembly successfully corrected 201 (38.7%) of the 519 highly likely or likely false positives. We also demonstrate that the two sequencing kits from Thermo Fisher (the Ion PI Sequencing 200 kit V3 and the Ion PI Hi-Q kit) exhibit different error profiles across different error types. A refined calling algorithm with better polymerase may improve the performance of the Ion Proton sequencing platform.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Single nucleotide variant sequencing errors in whole exome sequencing using the Ion Proton System

Errors in sequencing are a major obstacle in the interpretation of next-generation sequencing (NGS) results. In the present study, sequencing errors identified from analysis of single nucleotide variants (SNVs) identified during exome sequencing of human germline DNA were studied using the Thermo Fisher Ion Proton System. Two consanguineous cases were selected for sequencing using the AmpliSeq ...

متن کامل

Editorial: The Post-Exome Era

The Iranian Rehabilitation Journal (IRJ) invites research papers on the genetic basis of single gene and complex disorders. This vastly dynamic branch of science will complement the multidisciplinary wealth of expertise in the fields of social welfare and rehabilitation. The past few years have witnessed outstanding research projects on the genetic causes of numerous debilitating disorders, suc...

متن کامل

Evaluation of variable relative biological effectiveness and the creation of homogenous biological dose in the tumor region in helium ion radiation to the V79 cell line

In radiation therapy, ions heavier than proton have more biological advantages than a proton beam. Recently, ion helium has been considered due to high linear energy transfer (LET) to the medium and a higher relative biological effect (RBE). To design the spread-out Bragg peak (SOBP) of biological dose for radiation with any type of ion, we need exact values of RBE, which is dependent to dose, ...

متن کامل

Correction: OTG-snpcaller: An Optimized Pipeline Based on TMAP and GATK for SNP Calling from Ion Torrent Data

Because the new Proton platform from Life Technologies produced markedly different data from those of the Illumina platform, the conventional Illumina data analysis pipeline could not be used directly. We developed an optimized SNP calling method using TMAP and GATK (OTG-snpcaller). This method combined our own optimized processes, Remove Duplicates According to AS Tag (RDAST) and Alignment Opt...

متن کامل

Effect of Next-Generation Exome Sequencing Depth for Discovery of Diagnostic Variants

Sequencing depth, which is directly related to the cost and time required for the generation, processing, and maintenance of next-generation sequencing data, is an important factor in the practical utilization of such data in clinical fields. Unfortunately, identifying an exome sequencing depth adequate for clinical use is a challenge that has not been addressed extensively. Here, we investigat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2017